Compressed Vision for Efficient Video Understanding
نویسندگان
چکیده
Experience and reasoning occur across multiple temporal scales: milliseconds, seconds, hours or days. The vast majority of computer vision research, however, still focuses on individual images short videos lasting only a few seconds. This is because handling longer require more scalable approaches even to process them. In this work, we propose framework enabling research hour-long with the same hardware that can now second-long videos. We replace standard video compression, e.g. JPEG, neural compression show directly feed compressed as inputs regular networks. Operating improves efficiency at all pipeline levels – data transfer, speed memory making it possible train models faster much Processing signals has, downside precluding augmentation techniques if done naively. address by introducing small network apply transformations latent codes corresponding commonly used augmentations in original space. demonstrate our pipeline, efficiently popular benchmarks such Kinetics600 COIN. also perform proof-of-concept experiments new tasks defined over frame rates. long impossible without using representation.
منابع مشابه
Compressed-domain Object Detection for Video Understanding
In this paper, a novel algorithm for the real-time, unsupervised object detection in compressed-domain sequences is proposed. The algorithm utilizes color and motion information present in the compressed stream as well as a simple object model. Extraction of the MPEG-7 dominant color descriptor, clustering of macroblocks to dominant color clusters and model-based cluster selection are employed ...
متن کاملAn Efficient Watermarking Scheme for H.264/avc Compressed Video
Since H.264/AVC is the most widely-deployed video coding standard and has gained dominance, the necessity of copyright protection and authentication that are appropriate for this standard is unquestionable. According to H.264/AVC specific codec architecture, an efficient watermarking scheme for H.264/AVC video is proposed. The watermark information is embedded into quantized residual coefficien...
متن کاملAn Efficient Adaptive Boundary Matching Algorithm for Video Error Concealment
Sending compressed video data in error-prone environments (like the Internet and wireless networks) might cause data degradation. Error concealment techniques try to conceal the received data in the decoder side. In this paper, an adaptive boundary matching algorithm is presented for recovering the damaged motion vectors (MVs). This algorithm uses an outer boundary matching or directional tempo...
متن کاملVideo Abstraction in H.264/AVC Compressed Domain
Video abstraction allows searching, browsing and evaluating videos only by accessing the useful contents. Most of the studies are using pixel domain, which requires the decoding process and needs more time and process consuming than compressed domain video abstraction. In this paper, we present a new video abstraction method in H.264/AVC compressed domain, AVAIF. The method is based on the norm...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2023
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-26293-7_40